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Abstract 

We introduce a simple and generic model that reproduces Zipf's 
law. By regarding the time evolution of the model as a random walk 
in the logarithmic scale, we explain theoretically why this model re- 
produces Zipf's law. The explanation shows that the behavior of the 
model is very robust and universal. 
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1 Introduction 

Zipf's law is the observation that the frequency P of the occurrence of various 
events is an inverse power-law function P(> s) oc s _1 , where the rank s is 
determined by the descending order of the frequency of each event. IU Zipf 
first made this observation for the frequency of occurrence of English words in 
literature; that is, the most frequent word "the" (rank s = 1) appears twice 
as many times as the second most frequent word "of" (rank s = 2). Since 
then, Zipf's law has been found in various fields, including the population 
of cities0 and the asset distribution of companies. 0, £|] For example, Zipf's 
law for income distribution asserts that the size and number of companies 
are in inverse proportion; that is, the number of companies that have more 
than lOx assets is one-tenth of the number of companies that have more 
than x. This universality of Zipf's law, however, has not been well explained 
theoretically. To date, we have only models specific to each problem. || 

In the present paper, we introduce a simple and generic model that re- 
produces Zipf's law. We can regard this model both as the time evolution of 
the asset distribution and that of the population of cities. We also explain 
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theoretically why this model reproduces Zipf's law. Our explanation shows 
that Zipf's law of our model is very robust. 

2 Model and Simulation Results 

In this section we introduce our model and the results of its simulation. The 
model that we introduce here evolves as follows: First, we consider a set 
of positive values Xi with N entities. These values may be the assets of N 
companies or the population of iV cities. For simplicity, we assume that the 
initial values of Xj are all equal. Then we repeat the following procedures T 
times: 

1. Choose an entity i randomly from 1 < i < N. 

2. For i > 1, we move the amount axi-i from the (i — l)th entity to the 
ith entity, where a is a constant parameter with < a < 1. In other 
words 

Xi-i — > Xi_i - axi-x (1) 
Xi — > Xi + axi-i. (2) 

This procedure may be regarded as a deal between two companies or 
population movement between two cities. For % = 1, we increase all 
values Xi by the same amount x a/N. 

3. Rearrange the entities in the descending order of the values Xj. 

Let us explain this model from the viewpoint of asset distribution. We 
focus on the hierarchical structure of the companies. Assume that a company 
will tend to trade with a company of similar size. That is, money flows 
from one company to a smaller company and the smaller company delivers 
products to the larger company. Hence, in our model, money flows from 
high-rank companies to low-rank companies as in Eqs.(|l]) and (^|). 

Figure [1] shows results of the simulation of the above procedures for vari- 
ous values of a. Here the number of the companies is iV = 10 4 . In every case, 
the model reproduces Zipf's law as T — > oo. We see that the parameter a is 
related to the rapidity of the convergence to Zipf's law. In Fig.|l|(a) (a = 0.1) 
the distribution evolves slowly, but eventually converges to Zipf's law. 
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Figure 1: The cumulative asset distribution of our model: (a) a = 0.1; (b) 
a = 0.3; (c) a = 0.99. In each plot, the number of steps is T = 10 5 (dotted 
line), 10 6 (dashed line) and 10 7 (bold line). We normalized the results by the 
asset of the highest entity. The straight line indicates the power law with 
exponent —1, namely Zipf's law. 



3 Continuum Limit and Universality 



In this section we give a theoretical explanation as to why the model in 
the previous section reproduces Zipf's law. Let us consider the limit N — > 
oo. Then, the value of the ith entity is nearly equal to the value of the 
(i — l)th entity, that is Xi-i ~ X{. Hence we regard the set of variables X{ 
as a continuous variable x. We consider in this limit the time evolution of 
the probability distribution function of x. We note that Eqs.(|l]) and (Q) give 
probability flows in the directions 

x — > x — ax = (1 — a)x, (3) 
x — > x + ax = (1 + a)x. (4) 

The evolution equation of P(x) is therefore the following: 

JW t) = -2jP(x, t) + _^P( * t) + -J7_f>( * f), (5) 
at \ — a. 1 — a 1 + + a 

where t) is the probability distribution of x at time i and 7 is a certain 
constant. The first term is a flow from the point x to the points (1 ± a)x, 
while the second and third terms are flows from x/(l ± a) into x. The 
coefficients 1/(1 ± a) in front of P in Eq.fl5|) are necessary in order to satisfy 
the probability conservation. To see this, we integrate Eq.(|5]) over x as 



d_ 

dtJo 



/ P{x,t)dx = -27/ P{x,t)dx+ — — / P{ ,t)dx 

Jo Jo 1 — a Jo 1 — a 

1 + aJo v 1 + o ■ 1 K ' 



Changing the variables to be 



1 x n x 

x = 1 , x = T-; — , (7) 

1 — a 1 + a 



we can rewrite Eq.(|6]) as 




roc poo poo 

— I P(x,t)dx = -27/ P(x,t)dx + 'y P(x',t)dx' 
at Jo Jo Jo 

POO 

+ 7 / P(x",t)dx" = 0. 
Jo 

Thus, the total probability is conserved. 
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To see Eq. (^) from a different point of view, we change the variable to be 
£ = logx. The probability distribution function is transformed to 

P(x, t)dx = P(e 5 , t)e^di = P{£, t)d£. (8) 

In other words, we define 

p(£, t) = P(x, t)x = P(e ? , t)e 5 . (9) 

By rewriting P in terms of P as 

P(z,t) = e-«P(£,t) = ip(logx,t), (10) 

P{t— — '*) = ^i 3 (log T ^,t) = ^P(£-log(l T a),t),(ll) 

we transform the evolution equation fl5|) to 

^P(£,t) = -2 7 Pfot) + 7 P^ + /M) + 7% -/?-,*), (12) 

where 

P ± = T log(l T a)>0. (13) 

We can see the evolution equation ( p!2|) as a random walk from £ to 
(£ ± /3±), that is, a random walk with a fixed step size. Although we can 
solve Eq. (O) exactly, it is obvious even without solving it that the stationary 
solution is given by P(£, t) =constant as t — > oo. Changing the variable back 
to x = e^, we have P(x, t) oc x^ 1 as t — > oo, or Zipf 's law. 

From the above viewpoint, the resulting distribution always obeys Zipf's 
law as long as the diffusion of x is uniform in the logarithmic scale. We can 
thus generalize the model in §2 extensively. We assumed in §2 that a chosen 
entity is given some value from the entity one rank higher in the model, but 
this is not necessary. The model reproduces Zipf's law as long as there is a 
flow axi proportional to its size. For example, the following procedure ("the 
nonconservative model") also reproduces Zipf's law: 

1. Choose an entity % randomly from 1 < i < N. 

2. Give or reduce randomly the amount «Xj from the chosen entity: 

X{ — > Xi ± axi. (14) 
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The simulation of the above procedure yields Fig.^(a). 

It is even unnecessary to make a a fixed value; Choosing a randomly 
from < a < 0.99 at each step in this nonconservative model, we obtain 
similar results to those shown in Fig.^(b) . In this case, however, we need to 
bind the diffusion of x\ otherwise, P(x, t) — > as t — > oo. In Fig.||(b), we set 
the limit 1(T 4 < x < 10°. 

4 Summary 

In this paper, we explained the universality of Zipf's law theoretically and 
demonstrated it numerically. The simulation results of our simple and generic 
model indicate that the present explanation of Zipf's law is applicable to 
various phenomena, both natural and social. This paper does not consider 
the most famous case of Zipf's law, the frequency of English words. We 
speculate, however, that Zipf's law of English words might be explained by 
the same time development as in this model. 

References 

[1] G. K. Zipf: Human Behavior and the Principle of Least Effort (Addison- 
Wesley, Cambridge, 1949). 

[2] For example, M. Marsili and Y.-C. Zhang: Phys. Rev. Lett. 80 (1998) 
2741. 

[3] K. Okuyama, M. Takayasu and H. Takayasu: Physica A 269 (1999) 125. 

[4] H. Takayasu, M. Takayasu, M. P. Okazaki, K. Marumo and T. Shimizu: in 
Paradigms of Complexity, ed. M. M. Novak (World Scientific, Singapore 
2000) p.243. 



6 




Figure 2: The results of the simulations of the nonconservative model: (a) 
a = 0.3; (b) randomly chosen a. In each plot, the number of steps is T = 10 5 
(dotted line), 10 6 (dashed line), 10 7 (bold line). We normalized the results 
by the asset of the highest entity. The straight line indicates the power law 
with exponent —1, Zipf's law. 
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